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COMPLEXITY OF NILSYSTEMS AND SYSTEMS 
LACKING NILFACTORS 

BERNARD HOST, BRYNA KRA. AND ALEJANDRO MAASS 



J>-^, Abstract. Nilsystems are a natural generalization of rotations 

^ ' and arise in various contexts, including in the study of multiple 

ergodic averages in ergodic theory, in the structural analysis of 
topological dynamical systems, and in asymptotics for patterns in 
certain subsets of the integers. We show, however, that many nat- 
ural classes in both measure preserving systems and topological 
dynamical systems contain no higher order nilsystems as factors, 
meaning that the only nilsystems they contain as factors are ro- 
tations. In the ergodic setting, we show that there are spectral 



(~| ' obstructions that give rise to this behavior. In the topological set- 



ting, nilsystems have a particular type of complexity of polynomial 
growth, where the polynomial (with explicit degree) is an asymp- 
totic both from below and above. We also deduce several ergodic 
and topological applications of these results. 

(N 
> 

00 

en . 1. The role of nilsystems 

Q \ 1.1. Nilsystems in ergodic theory. In studying multiple ergodic av- 

CN ■ erages in a measure preserving system (X, i3, /i, T), certain factors with 

algebraic structure occur naturally. (By a measure preserving system, 

we mean a Lebesgue probability space (X, i3, /i, T) endowed with a mea- 

^ \ surable, measure preserving transformation T: X — )■ X.) The simplest 

case is the invariant a-algebra that arises in the von Neumann mean 
ergodic theorem. Assuming henceforth that the system is ergodic, the 
relevant factor for the study of the average of /(T"a;)/(T^"'x) is the 
Kronecker factor Zi{X) = Zi{X,B, fi,T). This is the factor spanned 
by the eigenvalues of the system, and it is isomorphic to a translation 
on a compact abelian group endowed with its Haar measure. 
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For more intricate averages, for example the average of /(T"x) ■ 
/(T^"x) ■ . . . ■ /(T'*"^), we need a more sophisticated structural analysis 
of the system. In [20], this is done by introducing a series of factors 
Zs{X) = Zs{X,B, ^,T) for s > 1. The convergence of this average 
then follows by analyzing the averages separately in the factor Zs{X) 
and in its orthogonal complement. 

Such structural analysis only becomes useful when the factors have 
some sort of geometric or algebraic structure, and this is the content 
of the structure theorem in [20j. More precisely, the factor Zs{X) is 
the inverse limit of all s-step nilsystems that are factors of {X, B, fi, T). 
We call a factor of a system that is a nilsystem a nilfactor (for the 
definition of a nilsystem, see Section [272]) . 

The factors Zs{X) have since been used to understand other multi- 
ple ergodic averages (see, for example, |2T1 [271 113) ^^^ ^'^ prove new 
results on multiple recurrence (for example in |H[I71[T5]). Nilsystems 
arise naturally in other contexts such as additive combinatorics; as an 
example, we cite the recent use of nilsequences (introduced in [3], these 
are functions evaluated on an orbit in a nilsystem) in the work on 
patterns in the primes, as described in the program laid out in |19] . 

Our motivation in starting this work was to give examples of "sim- 
ple" and "natural systems" with explicit, nontrivial factors ^^(X) for 
some s > 1. Since all factors Zs{X), s > 1, are trivial for weakly mix- 
ing systems, we restrict our attention to non-weakly mixing ones. Of 
course, the notion of a "natural system" is not precisely defined, but it 
is clear that a system obtained by building an arbitrary extension of 
a given nilsystem is "artificial." Somewhat surprisingly, we found the 
task of finding non-artificial nilsystems harder than expected. It turns 
out that for many natural classes of systems, the factors Zs{X) coin- 
cide with the Kronecker factor Zi{X); there are spectral obstructions 
that force this, and this is explained in Corollary 13.21 

In Sections 13. 21 and [3. 3[ we give two applications for measure preserv- 
ing systems that have no nilfactors other than rotation factors. The 
first result is on optimal lower bounds for intersections of translates of 
a set, and the second application is a strengthening of results in [22j 
and [7] on the convergence of weighted polynomial multiple averages. 



1.2. Nilsystems in topological dynamics. As is often the case, the 
ergodic questions and results have counterparts in topological dynam- 
ics. As in the ergodic setting, we refer to topological factors that are 
nilsystems as topological nilfactors. Topological nilfactors naturally 
arise in the structural analysis of topological dynamical systems [23] . 
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We recall some definitions (I23j, see also [32j). Let {X,T) be a 
transitive topological system, meaning that X is a compact metric 
space endowed with a homeomorphism T : X — )■ X such that the orbit 
{T^x : n G Z} of some point x G X is dense. We can associate to this 
system an increasing sequence Ztop,s{X), s > 1, of topological factors. 
The first factor Ztop,i(X) is the maximal equicontinuous factor, also 
called the topological Kronecker factor, of (X, T). It is spanned by the 
continuous eigenf unctions of the system, and is topologically isomor- 
phic to a translation on a compact abelian group. For s > 1, Ztop.,{X) 
is the inverse limit of all s-step topological nilfactors of (X, T). In [23j 
and [52], it is shown that topological nilfactors can be characterized by 
dynamically defined "cubic subsets" of X^" , leading to a topological 
analog of the ergodic theoretic structure theorem. 

Finding natural topological systems with nilsystems as topological 
nilfactors also turns out to be nontrivial. One already has obstructions 
that arise from the ergodic setting, by considering any invariant mea- 
sure on the system. Secondly, since nilsystems have zero entropy, it is 
only of interest to consider distal systems with zero entropy. The rele- 
vant property seems to be some sort of complexity, and the notion we 
use is the topological complexity inspired in the notion of e-n spanning 
sets (see for example [35] for the origin of this definition in the work of 
Dinaburg and Bowen): 

Definition 1.1. Let (X, T) be a topological dynamical system and let 
(i be a distance on X defining its topology. For e > and n > 1, an 
e^n spanning set for (X, T) is a finite subset {xi, . . . ,Xm} of X such 
that for every x G X there exists j G {1, . . . , m} such that 

d{T''x, T^Xj) < e for every /c G {0, . . . , n - 1}. 

Let §>x,T,d{^in) denote the minimal cardinality of an e-n spanning set 
of (X, T). When there is no possible ambiguity, we omit the system 
and metric from the notation and write S(£, n). We call the function S 
the topological complexity of the system (X, T). 

Clearly, this notion depends on the choice of the distance d on X. 
However, if d and d' are distances on X defining its topology, then for 
every e > there exist ^i > and 772 > 0, both tending to zero with e, 
such that 

for every n G N. 

We also could have defined this in terms of e-n separated sets: an e-n 
separated set for (X, T) is a finite set {xi, . . . , Xm} of X such that for 
all Xi 7^ Xj there exists /c G {0, . . . , n — 1} such that d(T^Xi, T^Xj) > e. 
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Taking Sx,T,d{£,n) to be the maximal cardinality of an e-n separated 
set in X, we obtain the same behavior as n tends to infinity and e tends 
to 0. This remark is used (Section I5.2.2|) to give an upper bound for 
the complexity of nilsystems. 

The notion of topological complexity used here is closely related to 
the notion of the complexity of a cover studied in [5] and our results 
can be rephrased in this language (see Section HTTj) . 

We show that every nilsystem (X, T) has polynomial complexity 
(Theorem 14. 2p with an explicit degree, and most importantly, the de- 
gree is the same both above and below. This places a constraint on any 
system having such a system as a factor. The upper bound is related 
to the well known fact that in a nilsystem, the orbits of two distinct 
points diverge at a polynomial rate. However, our bound is a global 
invariant, as opposed to such an infinitesimal characterization, and it 
describes the long time behavior. A polynomial upper bound was given 
in [9], but without a clear control on the exponent. 

A natural question is whether a weak converse of Theorem 14. 21 holds: 

Question 1. Let (X, T) be a minimal topological dynam,ical system, 
such that for every e > 0, there exist constants c{e),c'{e) > such that 

c{e)n < §>x,T,d{,£,n) < c'{e)n 

for every n E N and c{e) — ?■ oo as e — t- 0. Is it true that (X, T) is a 
2-step nilsystem ? 

The assumption on the growth of c{e) is needed in order to eliminate 
the possibility that (X, T) is a subshift or, more generally, an expansive 
system. 

1.3. Nilsystems in symbolic dynamics. We recall that a subshift 
over the alphabet A is a closed, shift invariant subset X of A^, where 
A is a finite set and A^ is endowed with the natural compact topology 
and with the shift transformation. Endowed with the restriction of the 
shift, a subshift is a particular type of topological dynamical system. 

A natural question is to decide whether a given subshift has any 
nilfactor other than a rotation. Conversely, given a nilsystem, we can 
ask what kinds of subshifts admit it as a factor? This leads us to a 
notion of complexity classically used for subshifts: 

Definition 1.2 (see for example [12]). Let (X, T) be a subshift on the 
finite alphabet A. For every integer n > 1, let Gx{n) be the number 
of words of length n occurring in X. The combinatorial complexity of 
(X, T) is the function n H- Qx{n). When there is no possible ambiguity, 
we omit the space from the notation and write Q{n). 
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We show (Corollary 14. 3 p that a subshift with very low combinatorial 
complexity does not admit any nilfactor other than a rotation. More 
generally, subshifts with low combinatorial complexity do not admit 
any nilfactor of large order. 

The classical Morse-Hedlund Theorem states that a subshift X is 
finite (and thus only consists of periodic sequences) if and only if Q{n) = 
n for some n. On the other hand, Sturmian systems satisfy C(n) = n+1 
for every n > 1, and such systems are codings of irrational rotations 
on the circle. This leads to the following question: 

Question 2. Find an "optimal" coding of a minimal nilsystem (X, T). 
More precisely, define a subshift {Y, S) having {X, T) as a nilfactor and 
with minimal possible complexity Qy ■ 

1.4. Some questions. The discussion in Section [1.11 leads naturally 
to other questions. In the topological study of nilsystems (Section [L2]), 
notions of complexity play a key role in understanding the existence of 
nilfactors. We ask if something analogous holds in the ergodic setting: 

Question 3. Does the conclusion of Corollary \3.2\ remain valid when 
the spectral hypothesis is replaced by some hypothesis on the "measure- 
theoretic complexity" [TSj or on the "slow entropy" [2^ of the system? 

A related question is: 

Question 4. Compute the "measure-theoretic complexity" or the "slow 
entropy" of ergodic nilsystems. 

The ergodic results of this paper deal with factors. A natural gener- 
alization consists in proving similar results for joinings: 

Conjecture. Let {X, fi, T) be an ergodic system satisfying the spectral 
hypothesis of Corollary \3.^ and let {Y, u, S) be an ergodic nilsystem. 
Then every joining of these two systems is relatively independent with 
respect to the corresponding joining of their Kronecker factors. 

We conclude by returning to our original motivation: 

Question 5. Find "natural" nontrivial systems in any of the settings 
considered here (ergodic, topological or symbolic) that have an explicit 
nilfactor other than a rotation, meaning systems such that Zi{X) ^ 
Z2{X). More generally, describe classes of systems with Zs{X) ^ 
ZsJ^\(X) for some s > 1. 

Acknowledgments. We thank Dave Morris, Terry Tao, and Jean- 
Paul Thouvenot for helpful discussions. 
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2. Notation and basic definitions 

2.1. Measure preserving systems. Throughout the article, we omit 
the (T-algebra of measure preserving systems from our notation and 
write {X,fi,T) instead of {X,B, n,T). All subsets of X and functions 
on X are implicitly assumed to be measurable. If X is given a topolog- 
ical structure, the a-algebra is assumed to be the Borel o"-algebra. For 
simplicity we always assume that the transformation T is invertible. 
When / is a function on X, we write, as usual, Tf instead of / o T. 
Also, in a mild abuse of notation, we use T to denote the unitary 
operator / i— )■ / o T of L'^{fi). 

2.2. Nilsystems. Let G be a group. For a,b E G, the commutator 
(making a conventional choice in the order) of these elements is defined 
to be 

[a, b] = aba'^b^^. 

Throughout, we make use of several standard identities: for a,b E G, 
we have [b, a] = [a, b]^^ and for a,b,c E G, 

(1) [a, be] = [a, b] [b, [a, c]] [a, c]. 

lia E A and A, B are subsets of G, we write [a, B] for the group spanned 
by {[a, &]: b G B} and [A, -B] for the group spanned by {[a, 6]: a G 
A, b E -B}. The subgroups Gj, j > 1, of G are defined inductively by 

Gi = G ; Gj+i = [G, Gj] for every j > 1. 

One can check that 

(2) forallA;,£>l, [G^, G,] C Gfc+,. 

Let s > 1 be an integer. The group G is s-step nilpotent if G^+i = 
{1g}- In particular, G is 1-step nilpotent if and only if it is abelian. If 
s > 2 and G is s-step nilpotent but not (s — l)-step nilpotent, then we 
have 

(3) Gi 2 G2 2 G-a D ■ ■ ■ 2 Gs-i 2 C, ^ {1^}. 

If G is a Lie group, then Go denotes the connected component of its 
unit element l^. 

Let s > 2 be an integer, G be an s-step nilpotent Lie group, and 
r be a discrete and cocompact subgroup of G. The compact manifold 
X = G/T is called an s-step nilmanifold. The group G acts on X by 
left translation, and we write this action as {g, x) \-^ g ■ x. The Haar 
measure /x on X is the unique Borel probability measure fi on X that 
is invariant under this action. 

Let r be a fixed element of G and let T : X — )■ X be the map x i— )■ t-x. 
Then (X, T) is called a topological s-step nilsystem and (X, yU, T) a 
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measure theoretical s-step nilsystem, or just an s-step nilsystem. The 
basic properties of nilsystems were established in [jj and [20], and a 
more modern presentation is found in [25j. In particular, we have the 
equivalences: {X, T) is transitive if and only if it is minimal if and only 
if it is uniquely ergodic if and only if (X, /x, T) is ergodic. 

If (X, T) is minimal then, writing G' for the subgroup of G spanned 
by Go and r and setting F' = F fl G', we have that G = GT. Thus 
(X, T) = (X', V) where X' = G'/T' and T' is the translation by r on 
X'. Therefore, without loss of generality we can restrict to the case 
that G is spanned by Gq and r. 

We can also assume that Gq is simply connected (see for example |2B] 
or [1] for the case that G = Gq and j26] for the general case). 

3. Measure theoretical results: systems without 

nilfactors 

3.1. The spectrum of a nilsystem. Improving a result of Green [Ij, 
Stepin proved (see Starkov [33j for history and comments): 

Theorem ([33]). Let (X = G/F,/x, T) be an ergodic nilsystem with 
connected, simply connected group G. Then L'^{fi) can be written as 
the orthogonal sum -^^(/i) = 1-L (B T-i' of two T -invariant subspaces. 
The space % consists of functions f G -^^(/i) that factorize through 
G/G2F and the restriction ofT to this space has discrete spectrum. The 
restriction ofT to 1-i' has Lebesgue spectrum of infinite multiplicity. 

Green, Stepin, and Starkov only consider the case of a connected, 
simply connected group. But, in view of our applications, we need a 
similar result without any assumption that the group be connected. 
While the extension of Stepin's result to connected, but not simply 
connected groups, is standard, the generalization for non-connected 
groups is harder. Nilsystems arising from non-connected groups can be 
quite different than those arising from connected ones, and there does 
not seem to be a direct method of deducing the general case from the 
particular one. Adapting the existing proofs requires many changes, 
and so instead of modifying existing proofs we give a different one. 

For any s > 2, any s-step ergodic nilsystem that is not a rotation 
admits a 2-step nilfactor that is not a rotation. Thus, we only need 
such a spectral result for 2-step nilsystems: 

Proposition 3.1. Let (X = G/r,n,T) be an ergodic 2-step nilsystem 
that is not a rotation. Then L'^{n) can be written as the orthogonal 
sum L'^{fi) ='H®'H' of two closed T -invariant subspaces such that the 
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restriction of T to Ti has discrete spectrum and its restriction to H.' 
has Lehesgue spectrum of infinite multiplicity. 

The proof is elementary, but lengthy, and so we postpone it to Ap- 
pendix Rl 

Corollary 3.2. Let {X,fi,T) be an ergodic system and assume that its 
spectrum does not admit a Lehesgue component with infinite multiplic- 
ity. Then this system does not admit any nilsystem as a factor, other 
than a rotation factor. 

This result applies, in particular, to 

• Weakly mixing systems. 

• Systems with singular maximal spectral type. 

• Systems with finite spectral multiplicity. This class includes: 

— Systems of finite rank, for example substitution dynam- 
ical systems, linearly recurrent systems, Bratteli-Vershik 
systems of finite topological rank, and interval exchange 
transformations (for definitions and references, see for ex- 
ample [221 ED El El [IB]). 

— Systems of local rank one or of funny rank one (see [TUl [TT] , 
where the definitions are attributed to J. -P. Thouvenot). 

Since nilsystems have zero entropy, the result also applies to 

• Systems whose Pinsker factor belongs to one of the preceding 
types. 

3.2. First application: lower bounds for multiple recurrence. 

Theorem 3.3. Assume that {X,fi,T) is an ergodic system satisfying 
Zs{X) = Zi{X) for all s > 1, for example a system satisfying one 
of the properties listed after Corollary \3.2[ Let pi, . . . , pk be integer 
polynomials satisfying Pi{0) = for 1 < i < k. Then for every A G X 
and every e > 0, the set 

(4) {n e N: /i(AnT~p^("MnT-P2("Mn. . .nT-p'=("M) > fi{A)''+^-6} 

is syndetic. 

Remark. Here and in Theorems 13.41 and 13.5^ the hypothesis can be 
replaced by Z2{X) = Zi{X); it is known that this condition implies 
that Zs{X) = Zi{X) for all s > 1 (this follows, for example, from the 
inclusions ([3|) in Section [2l2|) . 

In [3] it is showed that the conclusion of Theorem 13.31 does not hold 
for non-ergodic systems, even in the simple case of A; = 2, pi{n) = n 
and P2{n) = 2n. The conclusion also fails for general ergodic systems. 
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for example for k > 4 and Pj{n) = jn for I < j < k. In both of these 
cases, the set of integers defined by (jl]) may be empty. 

On the other hand, the conclusion of Theorem 13.31 holds for weakly 
mixing systems [2]. Similar lower bounds for some particular choices 
of polynomials are found in |3l [161 D-^J- 

For convenience, we begin the proof with the case of linear exponents 
and then explain how the method extends to the polynomial case. We 
first show: 

Theorem 3.4. Assume that {X, /i, T) is an ergodic system with Zs{X) = 
Zi{X) for all s > 1, for example a system satisfying one of the prop- 
erties listed after Corollary \3.2[ Then for any integer k > 1, any set 
A G X , and any e > 0, the set 

(5) {n G N: /i(A n T^"A n • . . . • nT^'^^A) > /i(A)^'+^ - e} 

is syndetic. 

Proof. Fix A G X, an integer k > 1, and e > 0. 

Set g = K{1a \ Zi{X)) and notice that < g < 1. Recall that 
the Kronecker factor (Zi(X), z/, T) of {X,fi,T) is a compact abelian 
group, endowed with its Haar measure u, Borel cr-algebra Zi, and the 
transformation T is translation by some element a & Zi. For simplicity, 
in this proof we write Zi instead of Zi{X). For each t ^ Zi, let gt be the 
function given by gt{x) = g{x + t). Then there exists a neighborhood U 
of in Zi such that Hfifi — (71^1(1/) < £/k^ for every t E U. Therefore, for 
t e [/, we have that \\gjt — g\\L'^{u) < j^/k"^ for every integer I < j < k 
and 



k 

g-gf...-gktdu> / /+^ rfz/ - ^ ^ > /i(A) 

Let A := {n G N: na G f/}. Then for every n G A, 

g-T^g-...- T^^'g du > fx{A)''+^ - e. 



>"+' - e. 



Furthermore, A is a syndetic set, and thus there exists an integer L > 
such that every interval of length L in N contains at least one element 
of A. 

On the other hand, by hypothesis, g = E,{1a \ Zs{X)) since Zs{X) = 
Zi for all s > 1. Thus by [3, Corollary 4.6], the difference 

a„ := /i(A n T-^-A n • . . . ■ nr-'="A) - I g-T'g T^^'gdv 



10 BERNARD HOST, BRYNA KRA, AND ALEJANDRO MAASS 

converges to in uniform density, meaning that 

M+N-l 

lim sup — y^ \an\ = 0. 
N^+oo Men N ^ 

n=M 

In particular, the set 

A':= in en-. ^(AnT-Mn...nT-'=M) > I g-T''g-...-T^''gdu-e\ 

has lower Banach density one, meaning that 

lim inf — |A'n[M,M + A^)| = 1. 
N-i-oo Mm N^ ' 

Thus there exists an integer A^ > such that every interval of length 
A^ in N contains L consecutive elements of A'. By definition of L, any 
interval of length A^ in N contains some n G A' fl A and this integer n 
satisfies ([5]). D 

Now we prove Theorem 13. 3[ that is, the extension of Theorem 13.41 
for polynomial iterates. As the proof is similar but notationally more 
cumbersome, we only include an outline of the steps. 

Proof of Theorem \3.3[ Assume that {X, /i, T) is an ergodic system with 
Zs{X) = Zi{X) for all s > 1 and that pi, . . . , pk are integer polynomials 
satisfying Pi{0) = for 1 < i < k. In this proof, we write Zi, Zg, . . . , 
instead of Zi{X), Zs{X), .... 

By [21] and |26] . there exists an integer s > 1 such that for all 
functions /o, /i, . . . , /fc G L°°{fi), the averages over [Mi, Ni) of 

- /"e(/o I Zg) ■ TPi(")E(/i \Zs) ■■...■ TP'=(")E(/fc I Zg) dfi 

converge to zero for all sequences (Mj) and (A^j) of integers such that 
Ni — Mi — )■ +00. Proceeding as in the proof of the deduction of Corol- 
lary 4.5 from Theorem 4.4 of ^J, we deduce that 

- JEifo I Z,+i) ■ TP^We(/i I Z,+i) ■ . . . ■ TP'=(")E(/fc I Z,+i) dfi 



converges to zero in uniform density. By hypothesis and Corollary 13. 2[ 
we have that Z^+i = Zi. Applying this with fo = fi = ■ ■ ■ = fk = ^a, 
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and writing g = E,{1a \ Zi), we conclude that 

fi{A n T~pi("M n ■ ■ • n t-p'=("M) - f g- tpi(")^ ■■...■ t^^w^ dfi 

converges to zero in uniform density. 

We continue as in the proof of Theorem 13.41 Let u denote the Haar 
measure of Zi and let a be the element of Zi defining its transformation. 
For t and z & Zi, write gt{z) = g{z+t); choose an open neighborhood U 
of in Zi such that US'* — 5'||li(!/) < s/k'^ for every t & U. We now use a 
standard equidistribution method. Let H be the closed subgroup of T*^ 
spanned by (pi(n)t, . . . ,Pk{n)t) for n G Z and t G T. We have that H 
is equal to the set of (ti, . . . , t^) G T'^ such that aiti + . . . + a^tk = for 
every choice of (ai, . . . , a^) G Z'^ such that aipi + . . . + akPk is identically 
zero. By Weyl's Theorem [36], the sequence (^pi{n)a, . . . ,pk{n)a) is 
well distributed in H, meaning that for every continuous function on 

N+M-l 

■Vj: X] 4>{{Pi{n)a, . . . ,pk{n)a)) — > / (pdmn, uniformly in M, 

n=M -^ 

where rriH denotes the Haar measure on H. We deduce that for the 
open set U, 

{pi{n)a, . . . ,pk{n)a) e U x . . . x U 
for a syndetic set of n. We conclude as in the proof of Theorem 13.41 D 

3.3. Second application: weighted multiple averages. The sec- 
ond application is a strengthening of results in [22] and j7] on the 
convergence of weighted polynomial multiple averages. Recall that the 
Kronecker factor of an ergodic system is naturally endowed with a 
topology, making it a compact abelian group. 

Theorem 3.5. Let {X,T) be a uniquely ergodic topological dynami- 
cal system with invariant measure fi. Assume that {X,fi,T) satisfies 
Zs{X) = Zi{X) for all s > 1, for example a system satisfying one 
of the properties listed after Corollary \3.2i and that the projection tti 
of X onto its Kronecker factor is continuous. Then for any Riemann 
integrable function (p on X, any x E X, any system (y, z/, S*), any 
k > 1, any functions fi, ■ ■ ■ , fk £ L°°{^), o-nd any integer polynomials 
Pi, . . . ,Pk, the averages 

N-l 
ra=0 

converge in L'^{i') as N ^ +oo. 
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In particular, this result applies for substitution dynamical systems |3l 
and more generally for many linearly recurrent systems and systems of 
finite topological rank |6]. In |22], it was proved in the case of lin- 
ear polynomials for particular sequences, including, for example the 
Thue- Morse sequence. 

Proof. Assume that {X, T) is a uniquely ergodic system satisfying the 
hypotheses of Theorem 13.51 Since for every s > 1 we have that 
Zs{X) = Zi{X), by hypothesis the projection vr^: X — )■ Zs{X) is con- 
tinuous. Theorem 13.51 follows immediately by combining two results in 
the literature. The first is a weighted ergodic average for nilsequences: 

Theorem ([22], Theorem 2.19 and Proposition 7.1). Let {X,T) be a 
uniquely ergodic system with invariant measure ^ and let m > 1 be an 
integer. Assume that the factor map vr^: X — )■ Zs{X) is continuous. 
Then for any Riemann integrable function cf) on X , x (z X , and s-step 
nilsequence h={hn'-n&'L), the limit 

1 ^-1 
hm ^0(T"x)6„ 



AT^+oo X 

n=0 



exists. 



The second is a weighted ergodic theorem for multiple convergence 
along integer polynomials proved by Chu that generalized the linear 
case in 1221: 



Theorem ([7], Theorem 1.3). For any k,d E N, there exists an in- 
teger s > 1 with the following property: for any bounded sequence 
a = (a„ : n G Z), if the averages 
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ra=0 



converge as N ^ +oo for every s-step nilsequence b = {bn'- n E 1j), 
then for every system {Y, u, S), all fi, . . . , f^ G L°°[v), and all integer 
polynomials pi, . . . ,pk of degree < d, the averages 

N-l 



N 

n=0 



converge in L'^{i') 



D 
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4. Complexity of topological nilsystems 

4.1. Complexity and commutator dimension. Before stating the 
theorem, we define: 

Definition 4.1. If {X = G/T, T) is an s-step nilsystem for some s > 1 
and if r G G is the element defining T, the total commutator dimension 
p of X is defined to be 

s-l 

(6) p = y^ dim (range (Ad,- — id) ) ) . 

Our main result is: 

Theorem 4.2. Let {X = G/r,T) be a minimal s-step nilsystem for 
some s > 2 and assume that {X,T) is not an {s — l)-step nilsystem. 
Let dx be a distance on X defining its topology. Then for every e > 
that is sufficiently small, there exist positive constants C{e) and C'{e) 
such that the topological complexity S{e, n) of {X, T) for the distance 
dx satisfies 

(7) C{e)n^ < S{e,n) < C\e)n^ for every n > 1, 

where p is the total commutator dimension of X. Moreover, p> s — 1 
and C{e) — ?■ +oo when £ — )■ 0. 

Furthermore, for a suitably chosen distance on X , one can take 
C{e) = Ce~'^ and C'{e) = C'e~'^, where C and C are positive con- 
stants and d is the dimension of X . 

The result can be translated into the language of complexity of covers 
studied in [5j. We start by reviewing the definition. Let U be an open 
cover of X, and for every integer ra G N, write 

n-l 

Wo"-i := Y T-m. 

j=0 

Define 'N{L{,n) to be the minimal cardinality of a subcover of W^~^; 
the complexity function of U is the map n i— )■ y^{U,n). We have that 
^{SiU) < c{e) for every £ > is equivalent to yi(lA,n) < C'iU) for 
every open cover U of X, and these conditions are equivalent to the 
system being a rotation. The upper bound in ([7]) can be rephrased 
as saying that for every open cover U of X, there exists a constant 
C'{U) with K(W,n) < C'lu)nP for every n G N. The lower bound 
means that there exists an open cover U and a constant C{U.) such 
that Ji{U, n) > C{U)nP for every n G N. 
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Corollary 4.3. Let s > 1 and let (X = G/T, T) he a minimal nilsystem 
that is not an s-step nilsystem. Then 

liminf — S{6, n) — )■ +oo as £ — )■ 0. 

Applications of these results are given in Section El 

4.2. Conventions and notation. In the sequel, s > 2 is an inte- 
ger and {X = G/r,T) is a minimal s-step nilsystem that is not an 
(s — l)-step nilsystem. We let r denote the element of G defining the 
transformation T. 

As explained in Section 12. 2[ the assumption of minimality allows us 
to assume that G is spanned by the connected component Gq of the 
unit element Iq and r. We make further assumptions on the choice of 
the presentation G/T of the nilsystem X and then prove Theorem 14.21 
under these additional assumptions. This clearly implies the result in 
the general case. 

Let fi denote the Haar measure of X and let A denote the Haar 
measure of G. We normalize A such that the measure of any (Borel) 
fundamental domain of the projection tt: G — )• X is equal to 1. 

Here, and again in Section 15. ![ we impose conditions on the dis- 
tance dx on X defining the topology on X. Again, the conclusions of 
Theorem 14.21 remain valid for a general distance defining the topology. 

Throughout the proof, we often fix some e > and assume that e 
is sufficiently small. This means that it is smaller than some constant 
depending only on the nilsystem (X = G/r,T) and the distance dx 
defined on it, and not on any other parameter such as the integer n. 

Finally, we choose a bounded Borel fundamental domain D of the 
projection vr: G — )■ X. 

4.3. Some preliminaries. 

4.3.1. Choosing a distance on X . First, we choose a distance da on the 
group G that defines its topology. For the moment, we only assume 
that this distance is invariant under right translations, meaning that 
for all g,g',h g G, 

dcigh.g'h) =dGig,g'). 
The nilmanifold X is endowed with the quotient distance, meaning 
that for x,y E X, 

(8) dxix, y) = mi{dG{g, h) : ^{g) = x and Tr{h) = y}. 
In other words, for all g,h E G, we have 

(9) dxU{g),'K{h)) = ini dG{ga,hf3) = midG{g,h-f). 
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Since the inverse image under vr of every point of X is discrete, the 
infimums in these last two formulas are attained. 

We recall that for 2 < j < s, Gj and GjT are closed subgroups of 
G and that F n Gj is a cocompact subgroup of Gj. In particular, we 
deduce that there exists 5o > such that for 2 < j < s, 

(10) if 7 G r is within a distance 5q of Gj, then 7 G Gj fl F. 
In particular, we deduce that 

(11) if dci^g, g') < 5q and -n^g) = n{g'), then g = g' . 

4.3.2. Commutators. The following lemma is used to prove that the 
exponent p of Theorem 14.21 is > s — 1 . 

Lemma 4.4. For \ <i < s — \, we have that [r, G(\ ^ G^+2- 

Proof. Let G' = G/G^+2, vr: G — ?■ G' be the quotient homomorphism, 
F' = 7r(F) and r' = 7r(r). Let V be the translation by r' on X' = G'/T'. 
Then for every j > 1, we have that vr(Gj) = G'-. Therefore G^_,_2 — 
{1g'} and (X',T') is an {i + l)-step nilsystem. But G^+i 7^ G^+2 and 
so G^^^ 7^ {Ic}- Thus (X',T') is not a ^-step nilsystem. Moreover, 
since {X',T') is a factor of (X, T), it is minimal. 

We remark that 7r([r, G^]) = [r',G£]. Therefore, substituting X' for 
X, we are reduced to showing that r does not commute with G^ when 
X is a (£ + l)-step, but not £-step, nilsystem. 

Assume that r commutes with Gg. Consider the commutator map 
G^ X Go — )■ G^+i. Since G is (£ + l)-step nilpotent, by ([T]) this map 
is multiplicative in each coordinate separately. But by ([2]), [G^,G2] is 
trivial and so this commutator map induces a continuous map G^ x 
G0/G2 — )■ G^+i. Finally, G^ and G0/G2 are abelian and so this contin- 
uous map is bilinear. 

On the other hand, by minimality the subgroup of G spanned by 
F and r is dense in G. This and the hypothesis imply that [G^ fl 
F, G] = [Gi n F, F] C F. Therefore, for 7 G G^ fl F, the map g 1— )■ [7, g] 
continuously maps the connected group Go to the discrete group F, and 
so this map is trivial. We conclude that G^ fl F commutes with Gq. 

Therefore, the commutator map G^ x Gq — )■ G^+i induces a bilinear 
continuous map 

Let X'- G^+i — 7- T be a character of the abelian group G^+i. Then 
X o ■?/;: Gi/{Gi n F) X G0/G2 — >■ T induces a continuous group homo- 
morphism from group G0/G2 to the dual group of the compact abelian 
group Gi/i^Gi n F). Since this dual group is discrete and G0/G2 is 
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connected, this group honioniorphisni is trivial. It follows that % o ?/; is 
the trivial map. 

As this holds for every character x of G'^+i, ip is the trivial map. 
Combining this with definition (IT2!) . it follows that the commutator 
map G^ X Go — !■ G^+i is trivial, and so G^ commutes with Gq. 

By assumption, G is spanned by Go and r and Gi commutes with r. 
It follows that Gi is included in the center of G, that G^+i is trivial, 
and thus that G is £-step nilpotent, a contradiction. D 

4.3.3. Some linear algebra. We make use of the following estimate from 
linear algebra. The proof is postponed to Appendix [Bl 

Proposition 4.5. LetW^ he endowed with the Euclidean norm ||-|| and 
let the Lehesgue measure of a Borel subset K of W^ he written \K\. 
Let A be a d X d matrix and assume that it is unipotent, meaning that 
(id— Ay = 0. For every integer n >2, let 

Wn = {^eR'^: \\A''^\\ <1 forl<k<n}. 
If 

d-l 

p = 2^ dim(range(id —A) ), 
fc=i 
there exist positive constants C and C' (depending on d and on A) such 
that 

(13) Cn-P < |W„| < C'n-P 
for every n. 

AA. Reduction to a local problem. Throughout this section, n > 1 
is an integer. We assume that e is given and is sufficiently small, Xq is 
a point in X, and h ^ D is chosen such that 7r(/i) = xq. 

For the moment, we view xq and h as fixed, but it is important that 
the bounds do not depend on xq G X, and thus also not on h E D. 

We study the set V" C X defined by 

V = {x EX: dxiT''xo,T''x) <eioiO<k<n}. 

Since the infimum in the definition of dx (see ([9])) is attained, for 
every x E V there exists g E G with 

(14) x = g-xo anddcigAo) = dx{x,Xo). 

Furthermore, for sufficiently small e, it follows from ( ITT]) that these 
conditions completely define g. Let W denote the set of elements g 
associated in this way to points of V. Again, the choice of small e 
implies that W is included in the connected component Go of 1^ in G. 
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Convention. In the sequel, we often view x, and so g, fixed. When 
needed, we emphasize the dependence of (7 on x satisfying ( IH]) by 
writing g{x) instead of g. 

By hypothesis and (Q, for < A; < n, there exists 7^ G F with 

(15) dcir^h^k, T^gh) = dx{r^ ■ xq, r^g ■ xq) = dxiT'^x^, T^x) < e 

and, by (ITT]) again, this 7^ is unique. Clearly, 70 = 1g- 
In the remainder of this section, we show: 

Lemma 4.6. Let xq, x, g, and 7^ he as above. Then 7^ = 1g for 
< k < n. 



4.4.1. Initial computations. From the characterization (jTSJ) of 'jk, it 
follows that for < A; < n, 

e>dG{[T^h,-fkhkr''h, r'^gh) 
= dG{[T^h,'yk\lkT'^ , T'^g) by right invariance 
= dG{[T'h,^khkr',g[g-\T']T') 

= (iG([r''/i,7fc]7fc , g[g~^,r'']) by right invariance (again). 
Since ddg, 1g) < ^) we have that 



n. 



(16) rfG(r/^,7fc]7fc , [g~\A) <2EioiO<k< 
We proceed by induction on the degree s of the s-step nilsystem X. 

4.4.2. The case s = 2. Since G2 is included to the center of G, it 
follows from ([1]) that the map (x, y) H- [x, y] from G x G to 6*2 is 
bilinear and thus [g^^,T'^] = [g,T]^'' = [t, f?]^ for every k. Moreover, 
[t'^H, 7fc] belongs to the center of G, and Equation (fT6l) can be rewritten 
as 

(17) dci-fk , [r'=/i,7fc]-i[r, (^l'^) < 2^ for < A; < n. 

Therefore, for < A; < ra, we have that 7^ is within distance 2e of G2- 
By Remark (fTOj) . we have that 

7fc e G2 n F, 

and in particular 7^ belongs to the center of G. Then flT7|) implies that 

(18) rfG(7fc,[^,^]')<2£forO<A;<n. 

Since ^^•(s'j 1g) < ^; it follows that (^(^([r, (;], l^r) < Ce for some 
C > 0. Thus dci'yi, 1g) < (C* + 2)e. Since F is discrete and e is small, 

7o = 7i = 1g- 
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On the other hand, by (IT8l) again, 

dG{lklklilk+2 , Ig) < 8^ ioi < k < n - 2. 

Since F is discrete and e is small, 

lklk+ilk+2 = IcioY <k <n-2. 

We deduce that 7jt = 1^ for < A; < n, as announced. 

4.4.3. The general case. Assume that X is an s-step nilsystem for some 
s > 2 and that the result has been proven for an [s — l)-step nilsystem. 
Maintaining the same notation and conventions as above, we show that 
7fc = 1 for every k. 

Taking the quotient by Gg, the induction hypothesis implies that 

7fc G Gs for 1 < k < n 

and in particular 7^ belongs to the center of G. For every /c G Z, we 
write 

(19) g{k):=[g-\r% 

By the estimate in Equation (flGl) . we have that 

(20) dci^k , g{k)) < 2e ioT < k < n. 

A new difficulty arises here that does not come up for s = 2, as in 
general g{k) does not belong to the center of G. We begin by showing 
that the map k \-^ g[k) is a polynomial map from Z to G2 (see |i25j ) . 
The computations for this are fairly explicit and we include them. 

First, since ddg, 1g) < £, there exists a constant C > such that 
dG{[g-\ t''], Ig) <CehiO<k<s-l and, by definition ([HD of g{k) 
and ([20]), ^0(7^, 1g) < (C + 2)^ for < A; < s - 1. Since F is discrete 
and e is small, we deduce that 

(21) 7fc = 1g for < A; < s - 1. 

On the other hand, recall that the difference operator D maps a 
sequence {h{k) : k > 0) with values in G to the sequence {Dh{k) : k > 0) 
given by 

Dh{k) = h{kY^h{k + 1) for every A; > 0. 

Applying this definition to the sequence {^g{k) : A; > O) defined by (IT^ . 
it is easy to check by induction that for every j > 1, 

D^g{k) = T^VjT-^, 

where 

^1 = [fl-^S^] and Vj+i = [vJ^,t]. 

Therefore, for every j > 1 we have Vj G Gj+i and D^ g{k) G Gj+i 
for every k. (In the vocabulary of Leibman [25], this means that the 
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sequence {g{k): k > Oj belongs to the class y(o,i,2,...)^-) ^^ particular 
Vs = 1g and 

D^g(k) = 1g for every A; > 0. 

By the definition of the difference operator and by induction, we have 
that the sequence {g{k): k > 0) satisfies a recurrence relation of the 
form: for every A; > 0, 

Ig = D'g{k) = g{k + miy^g{k + m^y^^ . . . g{k + maO'"' , 

where 

(22) < rrij < s and rjj = ±1 for 1 < j < 2^ ; m2s = s and 772" = 1- 

Since 7^ belongs to the center of G for every i, 






= dG[{g{k + m,h-l^J^^ . . . {g{k + m,s)rkl^^J^' , 1g 

< J2M9ik + mjh,l^^ , 1g) < r^'e by m- 
i=i 
Since F is discrete and e is small, we deduce that 

(23) ^tm. . . . 7fcT„,. = 1g for every k > 0. 

In other words, the s^^ iterated difference [D'^'-jk) of the sequence (7^ : k > 
0) is trivial, and this sequence is a polynomial sequence in the abelian 
group Gs n r. 

Recalling that 1712^ = s and 772^ = 1, combining (^T^, f l25]) and fl22]) 
and using induction, we have that 7^ = 1^ for < /c < n (see also 
Proposition 3.1 in |25]). 

This concludes the proof of Lemma 14.61 D 

4.4.4. A summary. 

Corollary 4.7. Let xq G X and h & D with 7r(/i) = xq. Define 

V = {xeX: dxiT^XQ.T^x) <e forO<k<n}. 
Then, for every x & V , there exists a unique g = g{x) G G satisfying 

X = g -Xq and d^g, 1) = dxix, Xq). 
We have 

(24) dcir^gr-^ Ic) < e for < k < n. 
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The last formula follows from ( TTSl) , Lemma 14.61 and invariance of the 
distance under right translations. 

5. Proof of Theorem 14. 2t computing the complexity 



5.1. Working in the Lie algebra. As discussed in Section [2.21 we 
can assume that the connected component Go of the identity 1g is 
simply connected. 

As we consider neighborhoods W of Iq such that dcig, 1g) < ^ 
for every g G W, we have that W C Gq. Thus it suffices to give a 
description of Gq. 

Recall that the exponential map is a homeomorphism from the Lie 
algebra Q onto Gq. For every j > 1, we denote the Lie algebra of Gj 
by Qj and the exponential map takes Qj onto Gj. 

Let Q be endowed with some Euclidean norm ||-|| and some orthonor- 
mal basis for this norm. We use this basis to identify Q and M'*. 

Let the distance da on G be the right invariant Riemannian distance 
such that the associated norm on the Lie algebra Q is ||-||. We have 

(25) dciexpiO, 1g) = ll^ll for every ^ G Q. 

As before, we denote the Lebesgue measure of a subset £ of ^ by |£|. 
In general, the exponential map does not take the Lebesgue measure 
of Q to the Haar measure A of G. But, because all elements g under 
consideration belong to the compact subset 

K:={geG:dG{g,lG)<l} 

of G, the density (with respect to A) of the image of Lebesgue mea- 
sure under the exponential map is bounded above and below by some 
positive constants G and G': 

(26) for every L G K, 

G\{^eg: expire L}\ < A(L) <G'\{^eg: expire L}\. 

We recall that the linear map Ad^ : Q ^ Q is defined to be the 
differential of the map g i— )■ rgr^^ from G to itself, evaluated at the 
point Ig- 

We have that 

(27) rexp(^)r~^ = exp(AdT-^) for every ^ E G. 

Notation. We write $,-: G — )■ G for the map g i— t- rgr^^g^^. 

The differential of $,- evaluated at Iq is Ad,- — id: ^ — t- ^. Since G 
is s-step nilpotent, the s^^ iterate of $,- is the constant map Iq. Thus 

(Ad, - id)" = 0, 
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meaning that the map Ad^ — id is nilpotent. More precisely, for every 
fc > 1, we have that $^(6"^) C Gk+i and thus (Ad,- — id)^^ C Gk+i- 



Lemma 5.1. The total commutator dimension p (Definition 4-1) of X 
satisfies 

p > dim (range (Adr — id)) > s — 1. 

Proof. For 1 < i < s — 1, the restriction of the map ^^ to G^ maps this 
group to Gi+i. Composing it with the factor map G^+i — t- G'^+i/G'^+2 
we obtain a map \I/^: G^ — )■ G^+i/G^+2- By ([1]), this map is a group 
homomorphism. By Lemma 14. 4[ this homomorphism is not trivial 

Now we take differentials at the unit element Iq of G. The differential 
of $T- at 1g is Adr — id, the differential at 1^ of the quotient map 
G^+i -^ Gi+i/Gi,+2 is the quotient map pf. Qi+i -^ Qi+i/Qi+2, and 
thus the differential of "$£ at this point is pi o (Ad,- — id). Since the 
group homomorphism \&^ is not trivial, its differential at 1^ is not zero 
and thus p£ o (Ad,- — id) is not trivial. 

We conclude that (Ad,- — id)^^ ^ Gi+2- Since (Ad^ — id)^^+i C Ge+2, 
it follows that (Ad,- — id)^£ ^ (Adi- — id)^£+i. Thus 

dim((Ad^ - id)^^) > dim((Ad^ - id)^^+i) + 1. 

Summing this inequality for l<£<s — 1, we conclude that 

dim((Ad^-id)^) > s- 1. 

n 

5.2. Bounding the complexity. 
Notation. Set 

and note that /C is a compact subset of Q. Set 
Wn := {e e g-. IIAd^ell < 1 for < A; < n} ; 

W,,n:={exp{0:^eeWn} 

= {g EG: dair'^gr-'', Iq) < £ ior < k < n} by 1^ and 1^. 

(In the last line, we can write g E G instead of Go, as e small implies 
that all points of G at a distance < e of l^ belong to Gq.) 

Since the matrix of Ad,- is unipotent, by definition (^ oi p and 
Proposition 14. 5t we have that Ge'^'n"^ < {eWnl — G'e'^n'^. Thus, since 
We^n C K,hj (^ it follows that: 

Corollary 5.2. For every sufficiently small e > and every n eN, 

(28) Ce'^n'P < A(Vr,,„) < C'e'^n-P. 
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5.2.1. Lower bound for the complexity. We use this description to prove 
the lower bound of Theorem 14.21 

Assume that e > is sufficiently small and that {xi, . . . ,xn} is an 
e-n spanning set for X. Let 

Vj = {xeX: dx{T''xj,T''x) < e ioi < j < n}. 

By hypothesis, the union of the sets Vj cover X. 

For 1 < j < N, choose hj G D such that vr(/ij) = Xj. We are in the 
setting of Corollary 14. 71 For every j > 1 and every x & Vj, there exists 
a unique gj{x) G G with x = gj{x)-Xj = 7i{gj{x)hj) and dc{gj{x), Iq) = 
dx{xj,x) < e. By ([21]), we have that dG{T^gj{x)T~^,lG) < e for < 
k < n. In other words, by definition of the set W^^n, we have that 
gj{x) G W^^n- It follows that 

7r{\JW,^nhj)=X. 

Therefore, by choice of the normalization of A, 

1 < A(|J W.^nh,) < iVA(H^,,„) < NC'e'^n-P, 
i=i 

where the inequality follows from Corollary 15.21 We conclude that 
N > Ce~'^n^ for some constant C. D 

5.2.2. Upper bound of the complexity. Recall that D is a Borel funda- 
mental domain of the projection vr: G — )■ X. Let Di be a compact 
subset of G containing all points at a distance at most 1 from D. We 
make use of the definition of complexity using e-n separate sets (Sec- 
tion [L2]) to show: 

Lemma 5.3. Assume that e < 1. There exists a subset {hi, . . . , h]\f} 
of D with 

(29) N < X{Di)X{W,/2,n)-' 
such that 

N 

(30) Dg[J We,nhj. 

i=i 

Proof. Let A^ be the maximal cardinality of a subset {hi, . . . , hN} of D 
such that the sets We/2,nhj are disjoint. 
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Since every element g G Ws/2^n satisfies dcig, 1g) < ^/2 < 1, all the 
subsets Ws^nhj are contained in Di. Therefore, 

N 

i=i 

We claim that the set {hi, . . . ,hN} satisfies (130|) . Assume instead that 
this does not hold and that h & D does not belong to this union. It 
follows immediately from the definition that W^i2^n is symmetric and 
that We/2,n ■ We/2,n C W^^n- Therefore, for I < j < n, since h ^ W^^nhj, 
we have that Ws/2,nhj fl We/2,nh = 0. 

Setting /liv+i = h, we have that the set {hi, . . . ,hN, /iat+i} satisfies 
the imposed condition, contradicting the maximality of A^. D 

We now show that the upper bound of Theorem 14.21 holds, thereby 
completing the proof. Let e > be sufficiently small. Let A^ and 
{hi, ... , h]\f} be defined as in the conclusion of Lemma [5.31 

Let Xj = TT{hj) for 1 < j < A^. We claim that {xi, . . . , x^} is an e-n 
spanning set for X. 

Let X E X and h E D he such that 7i{h) = x. There exists j with 
1 ^ J < A^ such that h G W^^nhj, meaning that there exists g G W^^n 
with h = ghj. For < k < n, we have 

dx{T''x,T''xj) = dx{'K{T''gh,),Tx{T%)) < dG{T''gh„T%) 

k„ ^k\ J /-[ ^k „^~k\ 



dGir'g,T') = rfG(lG,rV ) < £ 



since g G W^^n- This proves the claim. 

On the other hand, by (129|) and Corollary 15.21 , we have that A^ < 
(j^-d^P fQj, some C > 0, concluding the proof of Theorem 14.21 

6. Some topological applications 

As in the ergodic setting, we find classes of systems such that all 
factors Ztop,s(-^), s > 1, of {X,T) are equal to Ztop,i(X). 

It follows directly from Corollary 13.21 that transitive systems of fi- 
nite topological rank satisfy this property; in particular, this is the 
case for minimal substitution dynamical systems and minimal interval 
exchange transformations. Namely, let {X, T) he such a system with 
nilfactor {Y, S). Since {Y, S) is transitive, it is uniquely ergodic (see [1] 
and |30j), and its invariant measure is the Haar measure z/ of Y. Let 
H he an invariant ergodic measure on (X, T). Then {X,fi,T) is a sys- 
tem of measure theoretical finite rank, and the topological factor map 
X — > y is also a measure-theoretic factor map. Thus by Corollary 13. 2[ 
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{Y, u, S) is measure theoretically isomorphic to a rotation. By the rigid- 
ity properties of nilsystems (see for example [231 Appendix A]), {Y, S) 
is topologically isomorphic to this rotation. 

Corollary 14.31 can also be used to find other such classes: 

Proposition 6.1. Let {X,T) be a transitive subshift and assume that 

\im.mf —Qx{n) < +oo. 

Then (X, T) does not admit any topological nilf actor other than rota- 
tions. Therefore, for every s > 1, the topological factor Ztop,s{X) of X 
is equal to its topological Kronecker factor Ztop,i(X). 
More generally, if for some s > 1 we have 

(31) limini —Gxin) < +cx), 

then {X, T) does not admit any nilsystem as a topological factor that 
is not an s-step nilsystem. Therefore, for every t > s, the topological 
factor Ztop^t{X) of X is equal to Ztop,s(X). 

Proof. The last statement follows immediately from the fact that Ztop,s{X) 
is the inverse limit of all s-step topological nilfactors of (X, T). 

Assuming (IHT]) . it suffices to show that there is no topological factor 
map 0: (X, T) — )■ {Y,S), where {Y,S) is a minimal {s + l)-step nil- 
system that is not an s-step nilsystem. Indeed, any minimal nilsystem 
that is not an s-step nilsystem admits an (s + l)-step nilsystem as a 
factor that is not an s-step nilsystem. 

Assume instead that such a factor map : (X, T) — )• {Y, S) exists 
and let dy be a distance on Y defining its topology. Recall that (X, T) 
is a transitive subshift on the finite alphabet A. We write x G X as 
X = {Xn : n G Z), and if / C Z is a finite interval, then we write xj for 
the finite sequence {Xn'. n & I). 

Let e > 0. Since is continuous, there exists an integer L = L{e) > 
such that dY{(f){x),(j){y)) < e whenever x,y E X satisfy X[-^l,l] = 
y[^L,L]- Therefore, ii x,y e X satisfy X[-L,n+L] = y[-L,n+L] for some 
n > 1, then dY{T''(f){x),T''(j){y)) < e for every A; G {0, . . . , n}. 

Fix n > 1 and set m = Gxin + 2L + 1). 

By definition, there exist m elements x^^^ , ■ ■ ■ , x^"^^ of X such that 
for every x G X, there exists j G {1, . . . , m} with X[^L,n+L] = X\_j^ n+Ll- 
By definition of L and since is onto, the set {(pix^^^) : 1 < j < rn} is 
an e-n spanning set of [Y, S). Thus 
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Since this holds for every n > 1, it follows from the hypothesis that 

liminf — §y,5,dy(£,^) < liminf — ex(ri) < +00. 
But this contradicts the statement of Corollary 14.31 D 

Remark. Under the hypothesis of "linear complexity," that is, that there 
exists a constant c > such that Gx{n) < en for every n G N, the first 
statement of Proposition 16.11 can also be deduced from Corollary 13. 2[ 
by the method discussed at the beginning of this section; in this case, 
the system {X,T) has topological finite rank [12j . 

Appendix A. Proof of Proposition 13.11 

For convenience, we repeat the statement of Proposition I3.lt 

Proposition. Let {X = G/r,n,T) be an ergodie 2-step nilsystem that 
is not a rotation. Then L'^in) can be written as the orthogonal sum 
L^(/i) = "H © "H' of two closed T -invariant subspaces such that the 
restriction of T to Ti has discrete spectrum and its restriction to Ti' 
has Lebesgue spectrum of infinite multiplicity. 

A.l. Reductions and notation. Before proving the proposition, we 
make some usual reductions that present the nilmanifold in a stan- 
dard way. Note that these reductions are not the same as those of 
Section O 

Let (X = G/r,fi,T) be an ergodie 2-step nilsystem that is not a 
rotation. As in Section [521 we write r & G for the element defining the 
transformation T and Gq for the connected component of the identity. 
By minimality we can assume that the subgroup spanned by Gq and r 
is dense in G. This implies that G2 is connected and thus included in 

Go- 

Let r' be the largest normal subgroup of G contained in P. By 
substituting G/V for G and P/P' for P, we reduce to the case that P 
does not contain any normal subgroup of G. Thus the action of G on 
X is faithful and it follows that P is abelian and that G2 is compact. 
Since this Lie group is abelian and connected, it is a finite dimensional 
torus. Furthermore, it follows that each of subgroups spanned by [r, P], 
by [r, G] and by [r, Gq\ is dense in G2. 

Let q: G ^ G/G2 and -k: G ^- X denote the quotient maps. We 
recall (see [30] and [26]) that the Kronecker factor of X is the compact 
abelian Lie Group Zi = G/G2T, endowed with its Lebesgue measure 
mzi and translation hj a = p o tt{t), where p: X — >■ Zi denotes the 
factor map. 
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We remark that for every a E G, the map g i— t- [a, g\ is a group 
homomorphism from G to G2 (see Section 12. 2p and that the kernel of 
this homomorphism contains G2- 

A. 2. Lebesgue spectrum. For x ^ G2, set 

"H^ = {/ G L'^ifi)'- f{u-x) = x{u)f{x) for every u & G2 and /i-a.e. x}. 

Each space "H^ is invariant under T and L'^in) is the orthogonal sum of 
these spaces. We have that "Hi is the space of functions that factorize 
through Zi, and each space T-L^ is invariant under multiplication by 
functions belonging to L°°{ji) fl "Hi. 
We show: 

Lemma A.l. Let x be a nontrivial character of G2- Then the spectral 
measure associated to any function in "H^ is absolutely continuous with 
respect to the Lebesgue measure mj ofT. 

Proof. Let % be a nontrivial character of ^2- 

The group homomorphism g ^^ xlo^ ^] factorizes through G/G2 and 
so there exists a character x of G/G2 satisfying 

(32) x[9. t]=xo qig) for every g eG. 

Let 7 G r. Since 7 commutes with F and with G2, the map g 1— ;■ [7, g] 
factorizes through the Kronecker factor Zi. Thus there exists a group 
homomorphism 7: Zi — )• G2 such that 

(33) 7 o p o 7r(5f) = [7, g] for every g E G. 

Let i^ be a compact subset of G such that the restriction to K of 
the projection vr: G — >■ X is onto. For every n G Z, we choose 7^ and 
gn with 

(34) 7n G F, gne K and r" = 5f„7„. 

The family L of characters Xm,n of G/G2 defined by Xm,n ° Qig) = 
x[9i9ngm^] is a bounded subset of G/G2- But the closed subgroup of 
G/G2 spanned by x is not compact; if not, the restriction of x to some 
open subgroup H of G/G2 would be trivial and so the restriction of 
X o g to Go would also be trivial, implying that x{[9i 'A) ~ '^ for every 
g E Go and x would be trivial (again, the subgroup spanned by [r. Go] 
is dense in G2), a contradiction. Thus this group is discrete and in 
particular the set 

A := {/c G Z: x'" ^ L} 
is finite. 
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For every n & Z, write 

(35) On = X°fn- 
We claim that for every 6 & Zi, 

(36) \{neZ: 9^ = 9}\ < |A|. 

To check this, let m and n be integers with 6m = 9n- By definitions (p5l) 
and ([33]), for every ^f G G we have x{bm,9]) = xibn,9]) and, by 
the choice ([M]) of 7„ and ^f^, we have^x(T""'"'\fi']) = x([fi'nfl',„\fl']) = 
Xm,n °(l{g)- By fl32|) . we conclude that ^^"^-"^ = Xm,n £ A and the claim 
follows. 

Let / e ?^j^ be a function belonging to the space C''{X) of fc-times 
differentiable functions on X, for some k to be defined later. Note that 
for every n G Z, the function x i— )■ /(^f^ • x) ■ f{x) belongs to the space 
Til and can be written as /i„ o p for some function h^ on Zi. Since all 
the elements g^ belong to the compact subset K oi G and the action 
of G by translation on C^(X) is continuous (with respect to the usual 
topology of C^(X)), all functions x i— )■ f{gn ■ x) ■ f{x) belong to some 
compact subset of C^{X). It follows that all the functions /i„ belong to 
some compact subset of C^{Z\). Taking h to be sufficiently large, this 
implies that 

(37) V" sup|/i^(6')| < +00. 

eeZi 

The Fourier- Stieljes transform of the spectral measure a^ of / is 
given by: 



^A"^) '■= / /(^"a;) ■ f{x) dfi{x) = / fignln ■ x) ■ f{x) dfi{x) by ([3 



fignfn o Pix) ■ x) ■ f{x) diJ,{x) by ([33]) 
j X° In^ p{x) ■ f{gn ■ x) ■ f{x) dfi{x) by the definition of "H^ 
dn{z)hn{z) dmzj^{z) by (l35l) and the definition of /i„. 



We deduce: 
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neZ 

< ^|{n G Z: On = 0}\ sup|M^)| 



nSZ nEi 






< 



|A| Y^ sup|/5^(0)| < +CX) by (IMD and ([37]). 



eeZi 



Therefore, the spectral measure of / is absolutely continuous with 
respect to the Lebesgue measure rrij of T. By density, this property 
extends to every function in Ti^. D 

We use this to complete the proof of Proposition 13.11 

A. 3. End of the proof of Proposition 13. 11 . For x = 1; iio^e that a 
function belonging to "Hi has discrete spectral measure. 

By ergodicity, {a": n e Z} is dense in Zi. The group of eigenvalues 
of {X,T) is E := {6{a): 9 G Z}. Since Zi admits a (nontrivial) torus 
as an open subgroup, E is dense in T. 

Let X be a nontrivial character of G2- For every 9 E Zi, the space T-Ly^ 
is invariant under multiplication by the function 9 op^ as this function 
belongs to L°°{^) fl "Hi. For / G l-L^, the spectral measure oi f ■ 9 o p 
is equal to the image of af under translation by 9{a). Therefore, the 
maximal spectral type a^. of the restriction of T to T-L^ is quasi-invariant 
under translation by A for every A G -E, meaning that ii A cT satisfies 
cr^^A) = 0, then a~^{A + A) = 0. But a^^^ is absolutely continuous 
with respect to Lebesgue measure and so cr^ is equivalent to Lebesgue 
measure. It follows that there exists a function /^ G "H^ such that 

On the other hand, the invariant spaces Ti^, for x 7^ 1 G G2, are 
mutually orthogonal, completing the proof. D 

Appendix B. Proof of Proposition 14.51 
For convenience, we repeat the statement of Proposition 14.51 

Proposition. Let M"' be endowed with the Euclidean norm \\-\\ and let 
the Lebesgue measure of a Borel subset K of W^ be written \K\. Let 
A be a d X d matrix and assume that it is unipotent, meaning that 
{id—AY = 0. For every integer n > 2, let 

Wn = {^e R'^: \\A''^\\ <l forl<k<n}. 
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// 

p = 2^ dim(range(id —A) ), 
fc=i 

there exist positive constants C and C (depending on d and on A) such 
that 

Cn'P < |>V„| < C'n'P 
for every n. 

Notation. Let J^. denote the r^r upper triangular elementary Jordan 
matrix whose entries are given by 

1 for 1 < i < r and j = i; 
'^r,i,j — '{ ^ fo^^ I < i < r — I and j = i + 1; 
otherwise. 

In other words, the matrix J^. has I's on the diagonal and on the su- 
perdiagonal, and O's elsewhere. 

We begin with a lemma: 

Lemma B.l. There exists a constant C = C{r) such that 

(38) if \xj\ < — — - for some n> 2 and I < j < r, 

r 

then \y^{Jr)i,j^j ^ C for I < i < r and < k < n. 

On the other hand, there exists a constant C = C'{r) > such that 
(39) 

r 

< 1 for 1 < i < r and < k < n for some n> 2, 






C" 
then \xj\ < — — - for 1 < j < r. 



Proof. For A; > 1, 

k 



(40) {J^kj ,^_, 



where we make use of the convention that ()=0ifp<0orp>A;. 
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To prove the first statement, assume that Xi, . . . ,Xr satisfy the hy- 
pothesis of (138 p . Then for 1 < i < r, 

completing the proof of (I38p . 

The proof of (15^ requires more work. Assume that Xi, . . . ,Xr satisfy 
the hypothesis of (I39p . 

Taking /c = 0, we have that 

(41) \xj\ < 1 for 1 < j < r. 

Thus, without loss of generality, we can restrict ourselves to the case 
that n is sufficiently large, and assume that n > 2*" + 1. Define the 
integer g > 1 by 

(42) 2''q<n-l<2''{q + l). 

In the sequel, we only make use of hypothesis 0390 with i = 1 and 
k = 2"^q with < m < r. Formula (HUj) for the coefficients of the 
matrix J^ gives 



IE 



2'"g 
1 ^-^ 






< 1 for < m < r. 



Since ( g^) = 1 and |xi| < 1, by (HTj) we have that 



(43) 

Define 
(44) 



IE 

i=2 



2"'q 






< 2 for < m < r. 



P2,j{k) = -^ ( . _ ^ ) for 2 < j < r. 



Then p2j is a polynomial of degree exactly j — 2 in the variable k. 
Formula (H3|) implies that 



(45) 



lEP2,,(2™g) 
i=2 



< 



2™g 



for < m < r. 



We continue by induction and assume that for some i with 2 < i < r, 
we have 



(46) 






< 



C 



— for £ — 2 < m < r. 



(2™g)^- 
where P£j{k) is a polynomial of degree exactly j — i for i < j < r. 



(47) 
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The same formula applied with m — 1 substituted for m leads to 



^p,,(2™-ig)x, 



< 



(2-g)^-i 



for £ — 1 < m < r. 



For i + 1 < j < r, define 



Pe+i,j{k) = -j:{pe,j{^) -Pe,j{k/2)). 

Then pg^ij{k) is a polynomial of degree < j — i — 1 in the variable k. 
In fact, this polynomial has degree exactly j — £ — 1, as the coefficients 
of maximal degree oi p£j{k) and pej{k/2) are not the same. 

Taking the difference between the formulas (jl6l) and (H7j) . the con- 
stant term pi^i{2'^q) — pe^i{2^~^q) vanishes and we have that 



(48) I Yl P^+i.(2"^?) 

j=e+i 






< 



C 



(2™g)' 



for i — 1 < m < r. 



By induction, Inequality (H71) is proven for 2 < £ < r. The polyno- 
mial p^ £ is a nonzero constant and we have that 



(49) 






C 



(50) 



(2^g)'-i' 

By backwards induction, we now show that 

C 



\Xi\ < 



for 1 < £ < r. 



(2^g)^-i 

For i = r, this is exactly (H9|) . Assume that 1 < £ < r and that this 
bound holds for i + l,i + 2, . . . ,r. By (H^ applied with m = r, 



\Xi\ < 



{2rq) 



^ j+J2 IP^>^-(2'-g)| ^ 



(2^g)J-i' 



Since pij is a polynomial of degree j — i, we have \pij{k)\ < Cy~^ for 
some C > and (150|) follows. 

We conclude the proof by using (142|) to conclude that 2'"g > (n — 
l)g/(g + l) >n/4. D 

Corollary B.2. Let W he endowed with the supremum norm ||-||oo o-nd 
\K\ denote the Lehesgue measure of a Borel subset K ofW. Then for 
every n > 2, we have 



Cn~-{r~l)/2 < 



{xgM^ \\JrX\\oo <l forO<k<n} 



< C"n-"("-^)/2. 
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Using this, we complete the proof of Proposition 14.51 Since the 
matrix A is unipotent, there exists a d x d invertible matrix $ such 
that the matrix B := $A$~^ is in Jordan form. Thus all the entries of 
B are all equal to other than m > 1 diagonal square blocks, each of 
which is an elementary Jordan Vj x r^ matrix J^ for 1 < j < m. 

We have 

|W„| = |det($)|-i|$(>V„)| 
for every n. Moreover, 

^{Wn) = {r]eR'^: \\(^B''7]\\ < 1 for < A; < n} 

and thus there exist positive constants c and c' with 

c{t] G R"^: ||5^/||oo < 1 for < A; < n} C <I>(W„) 

C c'{r] G R"^: \\B''r]\\^ < 1 for < A; < n}. 

Therefore 



c' 



[r] G R'^: \\B''r]\\oo < 1 for < A; < n} < |$(W„)| 

{r/ G M'^: \\B^ri\\^ < 1 for < A; < n} 



<c'^ 



On the other hand, 
{?7 G M'^: ll^'^r^lloo < 1 for < A; < n} 

m 

= Y[{V e R"': WJr^vWoo < 1 for < A: < n}. 

i=i 

Combining these remarks with Corollary IB.2t we have that 

Cn-P < |W„| < C'n-P, 
where 
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m Tj 



n-l 



j=i j=i k=i 

m d— 1 

= Y^ ^ A;(dim(ker(id -Jr,f^^) - dim(ker(id -Jr,f)) 
i=i k=i 

d-l 

= ^ fc(dim(ker(id-v4)'=+^) - dim(ker(id -A)^)) 
fc=i 

d-l 

= Y^ dim(range(id -A'^)). D 

k=l 
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